Private Information Disclosure from Web Searches

نویسندگان

  • Claude Castelluccia
  • Emiliano De Cristofaro
  • Daniele Perito
چکیده

As the amount of personal information stored at remote service providers increases, so does the danger of data theft. When connections to remote services are made in the clear and authenticated sessions are kept using HTTP cookies, data theft becomes extremely easy to achieve. In this paper, we study the architecture of the world’s largest service provider, i.e., Google. First, with the exception of a few services that can only be accessed over HTTPS (e.g., Gmail), we find that many Google services are still vulnerable to simple session hijacking. Next, we present the Historiographer, a novel attack that reconstructs the web search history of Google users, i.e., Google’s Web History, even though such a service is supposedly protected from session hijacking by a stricter access control policy. The Historiographer uses a reconstruction technique inferring search history from the personalized suggestions fed by the Google search engine. We validate our technique through experiments conducted over real network traffic and discuss possible countermeasures. Our attacks are general and not only specific to Google, and highlight privacy concerns of mixed architectures using both secure and insecure connections. Update: Our report was sent to Google on February 23rd, 2010. Google is investigating the problem and has decided to temporarily suspend search suggestions from Search History. Furthermore, Google Web History page is now offered over HTTPS only. Updated information about this project is available at: http://planete.inrialpes.fr/projects/private-information-disclosure-from-web-searches

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Private Information Disclosure from Web Searches. (The case of Google Web History)

As the amount of personal information stored at remote service providers increases, so does the danger of data theft. When connections to remote services are made in the clear and authenticated sessions are kept using HTTP cookies, data theft becomes extremely easy to achieve. In this paper, we study the architecture of the world’s largest service provider, i.e., Google. First, with the excepti...

متن کامل

Block ownership and information disclosure in privatized firms-Evidence of Web disclosure from China

This paper examines whether the different types of block shareholdings will have a different impact on the extent of Web voluntary disclosure during the differential privatization stages. Prior literature suggests that block ownership may have a substitutive or complementary monitoring effect on corporate disclosure. However, for economies transferring from state endowment to being privately he...

متن کامل

Towards Enforceable Data-Driven Privacy Policies

A defining characteristic of current web applications is that they are personalized according to the interests and preferences of individual users; popular examples are Google News and Amazon.com. While this paradigm shift is generally viewed as positive by both users and content providers, it introduces privacy concerns, as the data needed to drive this functionality is often considered privat...

متن کامل

DCNL: Disclosure Control of Natural Language Information to Enable Secure and Enjoyable E-Communications

Natural language communications using social networking and blogging services can result in the undesired revelation of private information. Existing disclosure control is tedious and error-prone because the user must set the disclosure level manually and must reconsider the level every time a new text is to be uploaded. This can lead to the revelation of private information or reduced enjoymen...

متن کامل

Polynomial-time Attack on Output Perturbation Sanitizers for Real-valued Databases

Output Perturbation is one of several strategies in the area of Statistical Disclosure Control (SDC), also known as Private Data Analysis. The general problem in SDC consists of releasing valuable information about individuals in a databasewhile preserving their privacy. Examples of this include databases containing health information about patients, customer electronic transactions, and web br...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010